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DESCRIPTION 

METHOD AND SYSTEM FOR AUGMENTING AN AUDIO SIGNAL 

The present invention relates to a method and system for augmenting 
an audio signal in accordance with extracted features of said audio signal. 
The present invention has particular, but not exclusive, application with 
systems that determine and extract musical features of an audio signal such 
as tempo and key. 

It is known to augment an audio signal with a video signal. The most 
common occurrence of this is in the music video industry, where a particular 
song or album has a video sequence created for the music, usually for the 
purpose of broadcasting the combined audio and video via a television 
network. The video signal is created manually, using the experience and talent 
of a number of people to select and/or create suitable video sequences that 
best enhances the music. The skill of the people involved is a crucial factor in 
the quality of the end product. It is also the case that such augmenting is a 
long and expensive process, with no real guarantee on the suitability of the 
end product for a large number of users. 

It is therefore an object of the present invention to provide an improved 
method and system for augmenting an audio signal. 

According to a first aspect of the present invention there is provided a 
method for augmenting an audio signal comprising receiving an audio signal, 
extracting features from said audio signal, generating a time ordered table of 
dramatic parameters according to the extracted features, obtaining media 
fragments at least in part in dependence on the table of dramatic parameters, 
and outputting said media fragments. 

According to a second aspect of .the present invention there is provided 
a system for augmenting an audio signal, comprising . an input. . device for 
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receiving an audio signal and processing means for extracting features from 
said received audio signal, for generating a time ordered table of dramatic 
parameters associated with said extracted features, for obtaining media 
fragments at least in part in dependence on said generated table of dramatic 
i parameters, and at least one output device for outputting said media 
fragments. 

Owing, to the above, extracted music features from an audio signal such 
as key and tempo are associated with dramatic parameters representing an 
overview of a media fragment which may contain an audio/video data file. For 
example, a fast tempo may fit well with a media fragment having a chase 
scene that is high speed and furious. 

The dramatic parameters advantageously represent features of a story 
such as mood, pace, incidents and so on. The dramatic parameters are also 

mapped onto extracted features of the audio signal such as key and tempo. 

Advantageously, the media fragments are stored in a database from 
which they are obtained. Alternatively, the fragments may be generated at 
runtime in accordance with a story template which comprises a general 
description or narrative structure for a story, and appropriate dramatic 
parameters. 

For example, a romance story template may have certain requirements 
such as musical keys associated with happiness (major keys) or sadness 
(minor keys). A narrative arc describing the progression of a romance story is 
represented in the story template. One example arc for a romance story may 
comprise the following, a chance meeting, development of a relationship, a 
tragedy and a happy or sorrowful ending. 

Hence, it becomes possible to generate, obtain and render a story to a 
consumer, the story being based on a consumers music selection. 

For example, a consumer may select four music tracks which are 
analysed to determine for example genre (or the consumer may input a 
preferred genre) and musical features are extracted from the tracks. The 
features are employed to determine a closely matching story template that is 
then employed to obtain appropriate matching media fragments. For example, 
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media fragments having dramatic parameters indicating major keys (happy), or 
minor keys (sad) and which also contain romantic content may then be 
obtained to make up the story. 

The media fragments are then rendered to the consumer along with 
their music selection. Hence, a consumer undergoes a story experience in 
tandem with, and informed by the selected music. 

Advantageously, combinations of musical keys (perhaps a key change 
within a music track or a key change from music track to music track) may also 
be employed to determine dramatic moments such as a build up and 
subsequent release of tension. 

In an embodiment the system comprises a personal computer 
connected to a database storing media fragments and story templates. The 
computer is provided with program code, which extracts musical features from 
audio signals representing a musical selection, and also with program code 
representing a story building module, which selects templates and obtains 
appropriate media fragments in dependence on determined dramatic 
parameters related to the extracted musical features. The dramatic 
parameters are represented in a configurable meta-language such as 
extended Mark-up Language (XML). The computer program matches story 
templates and media fragments having associated dramatic parameter tags 
and renders the story to the consumer. 

Advantageously, the computer is part of a home network having output 
devices whereby lighting may be changed at identified dramatic moments, and 
further aspects of the experiential environment surrounding the consumer may 
be controlled to provide a compelling story telling experience in conjunction 
with musical selection. Physical Mark-up Language (PML) may be used to 
accomplish the aforementioned. 

Owing to the invention, it is possible to augment an audio signal, without 
the need for human input, in a way that nevertheless produces an 
augmentation that relates to and compliments the audio. 
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The present invention will now be described, by way of example only, 
and with reference to the accompanying drawings wherein: 

Figure 1 is a schematic representation of a system for augmenting an 
audio signal, 

Figure 2 is a table relating musical features to dramatic parameters, 
Figure 3 is an example of a generated list of dramatic parameters, 
Figure 4 is an illustrative example of stored story templates, 
Figure 5 illustrates a database storing media fragments, 
Figure 6 is a flow diagram of a method for augmenting an audio signal, 
Figure 7 is a flow diagram of a PML method for augmenting an audio 
signal. 

It should be noted that the Figures are diagrammatic and not drawn to 
scale. Relative dimensions and proportions of parts of these Figures have 
been shown exaggerated or reduced in size, for the sake of clarity and 
convenience in the drawings. The same reference signs are generally used to 
refer to corresponding or similar features "in modified and different 
embodiments. 

Figure 1 shows a schematic representation of a system 100 for 
augmenting an audio signal. The system 100 consists of a central processing 
unit (CPU) 102 connected to memory (ROM) 104 and memory (RAM) 106 via 
a general data-bus 108. Computer code or software 1 10 on a carrier i 12 may 
be loaded into RAM 106 (or alternatively provided in ROM 104), the code 
causing the CPU 102 to perform instructions embodying a method or methods 
according to the invention. The CPU 102 is connected to a store 114 and to 
output devices 1 16, 1 18. A user interface (Ul) 120 is provided. 

The system 100 may be embodied as a conventional home personal 
. computer (PC) with the output device 116 taking the form of a computer 
monitor or display. The store 1 14 may be a remote database available over a 
network connection (LAN or WAN). Similarly in a home network (LAN) the 
output devices 1 1 6, 1 1 8 may be distributed around the home and comprise for 
example a wall mounted flat panel display, computer controlled home lighting 




units, audio speakers placed around the home and so on. The connections 
between the CPU 102 and other devices 116, 118 may be wireless (for 
example communications via radio standards IEEE802.11, ZigBee 
IEE802.15.4 or Bluetooth) or wired (for example communications via wired 

5 standards Ethernet, USB), or a combination of both. 

The system 100 requires input of audio signals from which features are 
extracted. The extraction of musical features from audio signals is described 
in the paper "Querying large collections of music for similarity" (Matt Welsh et 
al, UC Berkeley Technical Report UCB/CSD-00-1096, November 1999) the 

10 .contents of which are hereby incorporated. The paper describes how features 
such as an average tempo, volume, noise, and tonal transitions can be 
determined from analysing an input audio signal. A method for determining 
the musical key of an audio signal is described in the United States patent 
US5,038,658 to Tsuruta et al, hereby incorporated and to which the reader is 

15 directed. 

Hence, in the system 100 audio signals are input to the CPU 102 by 
input device 122. The CPU 102 receives the audio signals and extracts 
musical features such as volume, tempo, and key as described in the 
aforementioned references. In this embodiment, the audio signal is provided 
20 via an internal input device 122 of the PC such as a CD/DVD or hard disc 
drive. Alternatively, the audio signals may be input via a connection to a 
networked home entertainment system (Hi-Fi, home cinema etc). 

Those skilled in the art will realise that the exact hardware/software 
configuration and mechanism of provision of input audio signal is not 
25 important, rather that such signals are made available to said system 100. 

Example data structures . and methods of operation will now be 
described to illustrate aspects of the present invention with regard to, for the 
sake of simplicity and by way of example only, a PC embodiment of system 
100. 

30 Figure 2 shows an example table 200 for use by computer program 

110. The table illustrates a non-exhaustive relationship of extracted musical 
features 204 (MF) with predetermined and defined dramatic parameters 206 
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(DP). For example, it is well known that in general music having a Major Key 
(MAJ key) is associated with happy or uplifting feelings, whereas music played 
in a Minor key (MIN key) tends to be more sorrowful and sad. 

Furthermore, using a circle of 5ths diagram (well known to those 
; studying music) it is possible to determine related musical keys, which are 
harmonious in that a transition from ens key to another is not alarming to a 
consumer. In general, a transition from one key to an adjacent key on the 
circle produces to a consumer an acceptable harmonious transition, whereas a 
leap from one key to a non-adjacent key on the diagram produces an audible 
clash to a user. For example, closely related (adjacent) keys of C Major/A 
Minor are F Major/d Minor and G Major/e Minor. Hence, a shift in key from say 
C Major (either within a music track or from track to track) to E Major (non- 
. adjacent) would result in an audible clash being heard from a user. Such 
knowledge is useful in developing rules for which music tracks are suitable for 
cross-fading in and out of each other in an automatic DJ application. 

Such knowledge may be used to indicate a dramatic change in a story 
(for example, a happy marriage scene to a sad tragic scene, -with the music 
adding to the story experience). Hence, in Figure 2 a musical feature 
corresponding to such a combination of non-adjacent key changes is 
represented by (NAK) with the dramatic parameter CLASH. The table also 
illustrates that a dramatic parameter of FAST or SLOW may be associated 
with an extracted TEMPO (beats per minute - bpm) result. For example a 
bpm of greater than 120 may be assigned a FAST dramatic parameter 
whereas a song or track with a slower tempo may be assigned a SLOW 
dramatic parameter. Hence, in a general way the mood, changes of pace and 
incidents according to a music selection may be determined and represented. 

The examples of dramatic parameters are for illustrative purposes only 
with many possible dramatic parameters in the form of mark up language tags' 
(realised in XML for example) or other attributes. For example, other musical 
features such as type of instrument (a cello may be assigned a SAD/SLOW 
attribute), and a high-level genre descriptive tag may be defined. Many audio 
CD or MPS archives have genre descriptors associated with a track or CD e g 




7 



PHGB020255 



rock, romantic, trance etc and these could be retrieved and assigned dramatic 
parameters, www. q racenote . com is an example music database and service 
provider wherein the music is categorised as rock, electronic/dance, jazz and 
so on. Electronic/dance music selected from such a service may be 

5 associated with a dramatic parameter <FUTURE> or <CYBER> which 
generaiiy indicates the genre of a story which may be suitable. 

By way of example we now illustrate how such tags are used to help 
build a story informed by music. A consumer, wishing to be entertained, 
selects three music tracks for playing, track 1 (TR1), track 2 (TR2) and track 3 

10 (TR3) on their PC. The audio signals of the tracks are input by input device 
122 to CPU 102, which determines and extracts musical features from the 
audio signals. The CPU 102, acting under instructions from computer 
program 110, correlates the determined musical features (MF) 204 with 
associated dramatic parameters (DP) 206 using table 200 (Figure 2) and 

15 subsequently builds a table 300 as shown in Figure 3. 

The table 300, ordered by time (to create a timeline TL 304), indicates 
that the CPU 102 has determined that track 1 begins with a FAST tempo and 
has an associated dramatic parameter HAPPY (i.e. a major key). Following 
that TR2 clashes (NAK feature determined) and is in a minor key (with a 

20 corresponding dramatic parameter SAD). Track 3, similar to track 1 has been 
determined as being generally FAST (high tempo), and in a major key 
(HAPPY). 

Hence, a high level overview of the music selection is created in which 
the music evolves dramatically as fast (TEMPO TR1), HAPPY (Major key), a 
25 clash (NAK and minor key indicating sadness TR2) and an increase in tempo 
and happy mood for track 3. Having constructed this high level dramatic 
overview, the CPU 102 then connects to store 114 to search for a data 
structure containing a story template with a roughly matching dramatic 
overview. 

30 Figure 4 shows illustrative examples of a collection of story templates 

constructed for use with system 100. The collection may be arranged as 
shown in the diagram according to genre (GEN 1, GEN 2, GEN 3). The genre 
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T indi ° ated ^ C ° nSUmer ( ™ Ul 120 > ° r by for 

examp le rea , ng descr| . p , or tags Qf ^ ^ ^ ^ V 

may for example represem simp|e chi|dren 6 art ; 

omance or any other story classification genre deemed as suiJTYe 
5 em pla e s 404 may also store other information such as how many pr'injp e 
cnaracters or pro teg on,s, s are suitable (perhaps only 3 are needed L a sTort 
romance story involving a love triangle). 

related^ 77 T^" *" "* d ~ 

the revelafion o, a oharaofer ,o develop. Henoe, Applicants have realised .ha 
mus,ca, transitions, or track changes may be used (via a NAK de^on fo 
» examp-e) to delineafe phases iof fhe story or screen ,ay. Hence^g re4 
story template 1 (ST,) has a simple narratfve structure in which drastic 
parameters indicafe a sfory arc in which for examp,e the fira, ac , is fas, I v 8 
onto happy act „ and finishes with a fas, scene or scenes for ao, Tin 

nS V W ,T may m such a ,emp,ate would be a 

■n wh,ch an.mal fnends race, have a party ,o celebrate the winner and 
subsequently race home. ' d 

ST2 however, has a dlfferen, story arc in which ac, II has dramafic 
parameters associated with sad and slow scenes. The variabilis, of .^X 
^ates is a funcion o, ,he number o, dramafic parameters H ca nZ 

its; mus ;v ea,ure s,ase 122 - Furth ™' -* 

store nformafion related to ,he length o, time of each ac (and hence of J 

rr,rX;r sho ; s,ories havin9 - - ™^ -i- - « 

Z! f " " S ° °" aVSra9e) Whi,st *>*• may have 

^ r acte w,«h more dramafic parameters DP to indicate scene ohangls 

dTe 2 Th ° Se eXPerien0ed in ' he art °< "*» and scripting ma y 
define Cher parameters and character ruies appertaining to story temp.ates 





In operation, CPU 102 compares the DP of the story templates with 
those determined from the music selection (Figure 3) and selects a closely 
matching template. For example, comparison of the DP determined in Figure 
3 with ST2 of Figure 4 reveals a match and so ST2 is selected. Having 

5 selected a story template 404 the GPU 102 must then populate the template 
ST2 wiih actuai story content. 

Figure 5 shows an example store 1 14 in the form of a database 500 that 
stores a multiplicity of media fragments 502. A media fragment is a data entity 
comprising for example an encoded segment of film (sometimes referred to as 

10 a shot and wherein several shots make up a scene). Hence the media 
fragment has a certain associated time duration when played. The fragments 
502 are characterised by tagged attributes relating to dramatic parameters. 

The media fragment may be stored as, by way of example an XML 
record comprising a description of the audio/visual content of the scene (for 

15 example genre-Action, act 1, opening car chase, 4 minutes) and dramatic 
parameters associated with the content of the scene. Of course, in creating 
the fragments an overall story may be developed with several different endings 
and openings (for example, a story may start with a car, or bicycle chase 
scene) each suitable for different dramatic parameters (TEMPO). This 

20 enables some foreshadowing of the story to be maintained whilst still enabling 
different fragments to be shown in dependence on determined dramatic 
parameters of the music. Similar scenes having different durations may also 
be created. 

Hence, the computer program instructs CPU 102 to search the 
25 database for media fragments 502, which roughly match in terms of duration, 
act and dramatic parameters the selected story template. The fragments are 
obtained, ordered and cached in memory 106 (or hard disk drive), and 
subsequently output with the music. The database 500 may also optionally 
store, for convenience, the story templates 400 as illustrated in Figure 5. 
30 For example, with reference to the DP list generated in Figure 3 and the 

selected story template (ST2) of figure 4, and wherein a children's genre has 
been indicated (perhaps by the child itself via Ul 120). Story fragments having 
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a children's genre tag are searched to match the determined timeiine (TL) and 
dramatic parameters (DP). Suppose track , (TR1) iasts for 4 minutes, then an 
opemng media fragment is searched for which has a DP combination (FAST- 
HAPPY) and which lasts, for roughly 4 minutes. Simiiarly, a media fragment 
5 related ,o the first selected fragment) for act I, having a siower pace and a sad 
theme ,s searched for and which win ,as. roughiy the durafion of track 2 
Ftnaliy media fragments tagged as having a fast/happy ending and related to 
the first fragment are selected for rendering with track 3 

Hence, having identified and obtained suitable media fragments 502 
) the story is output on, in this PC embodiment, the display monitor 116 

Frgure 6 illustrates a flowchart showing the steps described above In 
step 602 an audio signal representing the music selection is input (IAS) and 
muni features extracted (EMF) in step 604. A dramatic parameter lis, 
charactensrng the music is generated (GEN DP, in step 606 following which In 
step 608 a story template is selected (ST, and used in conjunction with ,he 

s,r R a n T meterS ,0 OWain aPProPria * e me * a " a ~ < OMF >- ■*«* - 
step 610, the story is output (O/P, togetherwith the music selection 

Those skilled in the art win realise that the story may not be contiguous 
Content ,n the fonm of stylised comic book frames may be employed within 
medra fragments, hence easing the „me duration matching requirements and 
enablrng easier matching of scene output with identified dramatic moments in 
the selected music. Such "pre-canned» comic, cartoon or computer generated 
vrsua content may be relatively quickly created and indexed, enabling quick 
establrshmen, o, a substantia, database having many story templates and 
medra fragments available. The fragments may be stored in any convenient 
ashron for example as MPEG files linked with records indexing the files in 
terms of dramatic parameters, genre, characters and so on. 

in the above example, pre-canned content was retrieved for a user in 

3 "** ,emP ' a,e h3Vin9 a Wmeline W matched the 
dura on of the music selection. However, having selected a particular story 
template, the media fragments may be computer generated a, runtime by 
processor ,02. For exempts, a media fragment may contain computer 
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generated content in the form of a simple cartoon for a child. The matching of 
the fragments with the timeline of the music selection is then relatively simple 
since processor 102 generates the actual media fragment and its content. Of 
course, the need for story templates may be obviated in such a case, with 

5 simple rules (ACT 1, ACT 2, ACT 3, DP- HAPPY, FAST, HAPPY) aiding the • 
generation of the media fragments making up the story. 

In another embodiment illustrating advantageous aspects of an ambient 
intelligence system, the module represented by dashed outline 124 in Figure 1 
is implemented as a home network plug-in module 124 with network links to 

10 various output devices 116, 118 which are compliant with a mark-up language 
and instruction set designed to deliver real world experiences. An example of 
such a physical mark-up language (PML) is described in Applicants co- 
pending application WO 02/092183. PML comprises a means to author, 
communicate and render experiences to an end user so that the end user 

is experiences a certain level of immersion within a real physical space. For 
example, PML enabled consumer devices such as a HI-FI and lighting system 
can receive instructions from a host network device (the instructions may be 
embedded within a DVD video stream for example) that causes the lights or 
sound output from the devices to be modified. Hence a dark scene in a movie 

20 causes the lights in the consumer's home to darken appropriately. 

PML is in general a high level descriptive mark-up language which may 
be realised in XML with descriptors which relate to real world events (<dim 
light>). Hence, PML enables devices around the home to augment an 
experience for a consumer in a standardised fashion. 

25 Figure 7 illustrates an example PML flowchart embodying aspects of the 

present invention. Audio signals are input (IAS) 700 and analysed for music 
features (EMF) 702. The features, are provided to the story building module 
124 which generates a dramatic parameter table and timeline as previously 
described. The module 124 selects a story template 400 in dependence on 

30 the generated table and obtains suitable media fragments 502. The fragments 
(which may be pre-canned or runtime generated) include PML descriptors as 
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well as dramatic parameters. The fragments are then assembled by the 
module 124, and provided to a PML interpreter (PMLI) 704. 

Hence, PML commands associated with the fragments are interpreted 
and subsequently directed to PML enabled devices 706, 708, 710 on the home 
; network. For example, devices 706, 708 may represent a PML lighting system 
and a PML projector, i he devices respond to PML commands so as to for 
example, project changing lighting levels and colours on a wall at certain 
moments in the narrative and audio experience associated with the selected 
mus.c and obtained media fragments. Other devices 710 such as desktop 
fans, or air conditioning and heating systems may be controlled in dependence 
on the dramatic features in order to further augment the input audio signals 
and output story experience. 

In the above a system suitable for augmenting audio signals by 
prov,ding a narrative story experience was described. The system may be in 
the form of a dedicated module 124, or a general purpose computer, operating 
alone or in tandem with other devices on a consumer network. Extracted 
musical features such as key, key changes, tempo and so on are input to the 
computer which then relates the features to dramatic parameters. Appropriate 
med.a fragments are then obtained and subsequently output and rendered to a 
user. Other output devices may be used to further enhance and augment the 
story. 

From reading the present disclosure, other modifications will be 
apparent to persons skilled in the art. Such modifications may involve other 
features which are already known in the design, manufacture and use of 
mult,media system and component parts thereof and which may be used 
.nstead of or in addition to features already described herein without departing 
from the spirit and scope of the present invention. Furthermore, narrative 
structures and media fragments may vary from the explicit examoles herein 
described without departing from the spirit and scope of the present'invention 
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CLAIMS 



1 . A method for augmenting an audio signal comprising: 

• receiving an audio signal 

5 * extracting features from said audio signal, 

• generating a time ordered iabie of dramaiic parameters 
according to the extracted features, 

. • obtaining media fragments at least in part in dependence on the 
table of dramatic parameters, and 
io • outputting said media fragments. 



2. A method according to claim 1 , wherein said features extracted 
from said audio signal include one or more of tempo, key, volume. 

15 3. A method according to claim 1 or 2, wherein the generation of 

said table of dramatic parameters comprises retrieving a list of dramatic 
parameters and associated audio features, comparing and matching the 
extracted features with the retrieved associated audio features, and inserting 
an entry comprising the dramatic parameter associated with the audio feature. 

20 

4. A method according to claim 1, 2 or 3, wherein said dramatic 
parameters include mood, changes of pace, incidents. 

5. A method according to any preceding claim, wherein said 
25 obtaining said media fragments comprises selecting a fragment from a store, 

said fragment being stored with an associated dramatic parameter which 
matches the respective entry in the table of dramatic parameters. 

6. A method according to any preceding claim, wherein said 
30 obtaining said media fragments comprises generating a fragment. 
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7. A method according to claim 5 or 6, and further comprising 
receiving user input, said user input affecting said obtaining. 

8. A method according to any preceding claim, wherein said media 
fragments include video data. 

9. A method according to any preceding claim, wherein said 
outputting comprises storing said media fragments and said audio signal. 

10. A method according to any preceding claim, wherein said 
outputting comprises rendering said media fragments and said audio signal. 

11. A method according to any preceding claim, and further 
comprising, prior to obtaining said media fragments, selecting a story template 
at least in part in dependence on said table of dramatic parameters, said story 
template affecting said obtaining of media fragments. 

12. A method according to claim 11, wherein said story template 
comprises dramatic parameter data related to a narrative story structure. 

13. A method according to claim 12, wherein the selection of media 
fragments comprises matching the dramatic parameters of the selected story 
template with those of the media fragments. 

14. A method according to claim 11, 12 or 13, wherein the story 
template for selection is generated according to logical story structure rules 
and the dramatic parameter list. 

15. A method according to any preceding claim, wherein the 
dramatic parameters are represented by physical mark up language tags. 
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16. A method according to any preceding claim, wherein 
combinations of extracted features have associated dramatic parameters. 

17. A system for augmenting an audio signal, comprising an input 
5 device for receiving an audio signal and processing means for extracting 

features from said received audio signal, for generating a time ordered table of 
dramatic parameters associated with said extracted features, for obtaining 
media fragments at least in part in dependence on said generated table of 
dramatic parameters, and at least one output device for outputting said media 
10 fragments. 



18. A system according to claim 17, further comprising storage for 
storing said media fragments. 

15 1 9. A system according to claim 1 7 or claim 1 8, wherein said at least 

one output device comprises display means on which said media fragments 
are displayed. 

20. A system according to any one of claims 17, 18 or 19, wherein 
20 said at least one output device is responsive to instructions associated with 

said dramatic parameters. 

21 . Program code on a carrier which when executed by a processor 
cause said processor to perform any of the methods of claims 1 to 16. 

25 - • 

22. A database for use with the system of claim 17, said database 
storing media fragments with associated dramatic parameters. 
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METHOD AND SYSTEM FOR AUGMENTING AN AUDIO SIGNAL 

A system and method is described in which a multimedia story is 
rendered to a consumer in dependence on features extracted from an audk> 
signal representing for example a musical selection of the consumer. Features 
such as key changes and tempo of the music selection are related to dramatic 
parameters defined by and associated with story arcs, narrative story rules and 
f.lm or story structure. In one example a selection of a few music tracks 
provides input audio signals (602) from which musical features are 
extracted(604), following which a dramatic parameter list and timeline are 
generated (606). Media fragments are then obtained (608), the fragments 
having story content associated with the dramatic parameters, and the 
fragments output (610) with the music selection. 
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